Page 1 of 8 

RECEIVED 

AUG 0 1 2QQ2 

Weans vumno 




RAW SEQUENCE LISTING DATE : 07/24/2002 

PATENT APPLICATION: US/09/586, 106B TIME: 11:51:58 

Input Set : A:\08411-032001.txt 

Output Set: N:\CRF3\07242002\I586106B.raw UV) 

3 <110> APPLICANT: Wright, David A. 

4 Voytas, Daniel F. 

6 <120> TITLE OF INVENTION: Plant Retroelements and Methods Related Thereto 
8 <130> FILE REFERENCE: P-1065A 

10 <140> CURRENT APPLICATION NUMBER: 09/586,1066 

11 <141> CURRENT FILING DATE: 2000-06-02 

13 <150> PRIOR APPLICATION NUMBER: 60/087,125 

14 <151> PRIOR FILING DATE: 1998-05-29 

16 <150> PRIOR APPLICATION NUMBER : 09/322,478 

17 <151> PRIOR FILING DATE: 1999-05-28 CMTPRPn 
19 <160> NUMBER OF SEQ ID NOS : 169 £ |M I C It I— 

21 <170> SOFTWARE: Patentln Ver. 2.1 

23 <210> SEQ ID NO: 1 

24 <211> LENGTH: 18 

25 <212> TYPE: DNA 

26 <213> ORGANISM: Glycine max 

28 <400> SEQUENCE: 1 

29 tggcgccgtt gecaattg 

32 <210> SEQ ID NO: 2 

33 <211> LENGTH: 18 

34 <212> TYPE: DNA 

35 <213> ORGANISM: Glycine max 
3 7 <400> SEQUENCE: 2 

3 8 tggcgccgtt gtcgggga 

41 <210> SEQ ID NO: 3 

42 <211> LENGTH: 6 

43 <212> TYPE: DNA 

44 <213> ORGANISM: Glycine max 

4 6 <400> SEQUENCE: 3 
47 ttgggg 

50 <210> SEQ ID NO: 4 

51 <211> LENGTH: 7 

52 <212> TYPE: PRT 

53 <213> ORGANISM: Artificial Sequence 

55 <220> FEATURE: 

56 <223> OTHER INFORMATION: Description of Artificial Sequence: plant, 

57 retroelement sequence 

59 <400> SEQUENCE: 4 

60 Met Ala Ser Arg Lys Arg Lys 

61 1 5 

64 <210> SEQ ID NO: 5 

65 <211> LENGTH: 1263 

66 <212> TYPE: DNA 



file://C:\Crf3\Outhold\VsrI586106B.htm 



7/24/02 



Page 2 of 8 



DATE: 07/24/2002 
TIME: 11:51:58 



AUG 



01 



7 ^0 



plant 



67 <213> ORGANISM: Artificial Sequence 

% lilt OTHER^IMFORMATIOH : Description of Artificiai Sequence, 

71 retroelement sequence 

73 <400> SEQUENCE: 5 acacccqqgq aagcgtccaa ctgggactct 60 

74 atggcctccc gtaaacgcaa agctgtgocc acacccgggg ^ a gctccggaac 120 

75 tcacqtttca ctttcgagat tgcttggcac aga r» ttaatgagtt cctgcaggaa 180 
atccttccag agaggaatgt agagcttgga ccagggatgt ttga g^g gattgatgtt 40 

77 ctccagaggc tcagatggga ccaggttctg ^cgact agg accacag tccgaagttt 300 

78 gctctggtga aggagtttta ctccaaccta tatgatcc g J*^^ tttcctcgac 60 

79 tggagtgttc gaggacaggt tgtgagattt g g y act ctcagta cctcagcact 420 

80 accccggtca tcttggcaga ^ ^tactc cagggggacg atttgttctg 480 

81 cctccagacc atgatgccat cctttccgct "9 9 tgacgc t cgcgcagaca 540 

82 aatgttgata gtgccccctg gaagctgctg ^aaggatc g J tattaatgtt 600 

83 tggagtgtgc tctcttattt tajccttgca ^tttc acctggacgt gggcagcctc 

84 gacagggccc gactcaatta tg**tggtg atg g g ccaggcttgg gt tcccagcg 720 

85 atttctcttc agatcagtca gatcgcccag ^ at accctgat ttttgagtca 780 

86 ttgatcacaa cactgtgtga ? attcagggg ^gtctctg ^ tgccgatcca 840 

87 ctcagtcctg tgatcaacct tgcctacatt aag g cttC ggcgtc ggcatctgag 900 

88 tctatcacat ttcaggggac ccgccgcacg cgcaccag g gC ctccactt 960 

89 gctcctcttc catcccagca tccttctcag cctttttccc agag g gtaccagggt 1020 

90 ctatccacct cagcacctcc atacatgcat gacagatg J ggatctgcca 1080 

5 SSSS 5» SJ 1 ~ 

S SS c~ MS -™ " 9M9cagc I 2 ," 

95 tga 

98 <210> SEQ ID NO: 6 

99 <211> LENGTH: 421 

100 <212> TYPE: PRT 

Ml <213> ORGANISM: Artificial Sequence 

103 <220> FEATURE: , nHnn of Artificial Sequence 

104 <223> OTHER INFORMATION : Description of Art 

105 retroelement sequence 

107 <400> SEQUENCE: 6 _ h p Gly Glu A la 

108 Met Ala Ser Arg Lys Arg Lys Ala Val Pro 15 

1° 9 1 o 5 1M Phe Thr Phe Glu He Ala Trp His Arg Tyr 

111 Asn Trp Asp Ser Ser Arg Phe Thr Pne 3Q 

112 T ?° nn Leu Arg Asn He Leu Pro Glu Arg Asn Val Glu 
114 Gin Asp Ser He Gin Leu Arg 45 

S «. «T p" «T P*e Asp OXU Phe L eu Cin CI. ,eu Gin Ar g ,eu 

\ll Aru t" asp Sin V.l ,eu »r *, ,eu Pro «. «» »P ™J 

121 65 ™ Asn Leu Tyr Asp Pro Glu Asp His 

123 Ala Leu Val Lys Glu Phe Tyr Ser Asn ^ g5 

124 85 „ „„, ara civ Gin Val Val Arg Phe Asp Ala 

126 Ser Pro Lys Phe Trp Ser Val Arg Gly 11Q 

127 WO 



<TER 



too? 



W00/2c C: 



plant 



Ser 
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129 Glu Thr lie Asn Asp Phe Leu Asp Thr Pro Val lie Leu Ala Glu Gly 

130 115 120 125 



132 


Glu 


Asp 


Tyr 


Pro 


Ala 


Tyr 


Ser 


Gin 


Tyr 


Leu 


Ser 


Thr 


Pro 


Pro 


Asp 


His 


133 




130 










135 










140 










135 


Asp 


Ala 


He 


Leu 


Ser 


Ala 


Leu 


Cys 


Thr 


Pro 


Gly 


Gly 


Arg 


Phe 


Val 


Leu 


136 


145 










150 










155 










160 


138 


Asn 


Val 


Asp 


Ser 


Ala 


Pro 


Trp 


Lys 


Leu 


Leu 


Arg 


Lys 


Asp 


Leu 


Met 


Thr 


139 










165 










170 










175 




141 


Leu 


Ala 


Gin 


Thr 


Trp 


Ser 


Val 


Leu 


Ser 


Tyr 


Phe 


Asn 


Leu 


Ala 


Leu 


Thr 


142 








180 










185 










190 






144 


Phe 


His 


Thr 


Ser 


Asp 


He 


Asn 


Val 


Asp 


Arg 


Ala 


Arg 


Leu 


Asn 


Tyr 


Gly 


145 






195 










200 










205 








147 


Leu 


Val 


Met 


Lys 


Met 


Asp 


Leu 


Asp 


Val 


Gly 


Ser 


Leu 


He 


Ser 


Leu 


Gin 


148 




210 










215 










220 










150 


He 


Ser 


Gin 


He 


Ala 


Gin 


Ser 


He 


Thr 


Ser 


Arg 


Leu 


Gly 


Phe 


Pro 


Ala 


151 


225 










230 










235 










240 


153 


Leu 


He 


Thr 


Thr 


Leu 


Cys 


Glu 


He 


Gin 


Gly 


Val 


Val 


Ser 


Asp 


Thr 


Leu 


154 










245 










250 










255 




156 


He 


Phe 


Glu 


Ser 


Leu 


Ser 


Pro 


Val 


He 


Asn 


Leu 


Ala 


Tyr 


He 


Lys 


Lys 


157 








260 










265 










270 






159 


Asn 


Cys 


Trp 


Asn 


Pro 


Ala 


Asp 


Pro 


Ser 


lie 


Thr 


Phe 


Gin 


Gly 


Thr 


Arg 


160 






275 










280 










285 








162 


Arg 


Thr 


Arg 


Thr 


Arg 


Ala 


Ser 


Ala 


Ser 


Ala 


Ser 


Glu 


Ala 


Pro 


Leu 


Pro 


163 




290 










295 










300 










165 


Ser 


Gin 


His 


Pro 


Ser 


Gin 


Pro 


Phe 


Ser 


Gin 


Arg 


Pro 


Arg 


Pro 


Pro 


Leu 


166 


305 










310 










315 










320 


168 


Leu 


Ser 


Thr 


Ser 


Ala 


Pro 


Pro 


Tyr 


Met 


His 


Gly 


Gin 


Met 


Leu 


Arg 


Ser 


169 










325 










330 










335 




171 


Leu 


Tyr 


Gin 


Gly 


Gin 


Gin 


He 


He 


He 


Gin 


Asn 


Leu 


Tyr 


Arg 


Leu 


Ser 


172 






340 










345 










350 






174 


Leu 


His 


Leu 


Gin 


Met 


Asp 


Leu 


Pro 


Leu 


Met 


Thr 


Pro 


Glu 


Ala 


Tyr 


Arg 


175 






355 










360 










365 








177 


Gin 


Gin 


Val 


Ala 


Lys 


Leu 


Gly 


Asp 


Gin 


Pro 


Ser 


Thr 


Asp 


Arg 


Gly Glu 


178 




370 










375 










380 










180 


Glu 


Pro 


Ser Gly 


Ala 


Ala 


Ala 


Thr 


Glu 


Asp 


Pro 


Ala 


Val 


Asp 


Glu 


Asp 


181 


385 










390 










395 










400 


183 


Leu 


He 


Ala 


Asp 


Leu 


Ala 


Gly 


Ala 


Asp 


Trp 


Ser 


Pro 


Trp 


Ala 


Asp 


Leu 


184 










405 










410 










415 




186 


Gly Arg Gly 


Ser 


Glx 

























187 420 

190 <210> SEQ ID NO: 7 

191 <211> LENGTH: 1596 

192 <212> TYPE: DNA 

193 <213> ORGANISM: Artificial Sequence 

195 <220> FEATURE: 

196 <223> OTHER INFORMATION: Description of Artificial Sequence: plant 

197 retroelement sequence 

199 <400> SEQUENCE: 7 

200 atgcgaggta gaactgcatc tggagacgtt gttcctatta acttagaaat tgaagctacg 60 
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201 tgtcggcgta acaacgctgc aagaagaaga agggagcaag acatagaagg aagtagttac 120 

202 acctcacctc ctccttctcc aaattatgct cagatggacg gggaaccggc acaaagagtc 180 

203 acactagagg acttctctaa taccaccact cctcagttct ttacaagtat cacaaggccg 240 

204 gaagtccaag cagatctcct tactcaaggg aacctcttcc atggtcttcc aaatgaagat 300 

205 ccatatgcgc atctagcctc atacatagag atatgcagca ccgttaaaat cgccggagtt 3 60 

206 ccaaaagatg cgatactcct taacctcttt tccttttccc tagcaggaga ggcaaaaaga 420 

207 tggttgcact cctttaaagg caatagctta agaacatggg aagaagtagt ggaaaaattc 480 

208 ttaaagaagt atttcccaga gtcaaagacc gtcgaacgaa agatggagat ttcttatttc 540 

209 catcaatttc tggatgaatc ccttagcgaa gcactagacc atttccacgg attgctaaga 600 

210 aaaacaccaa cacacagata cagcgagcca gtacaactaa acatattcat cgatgacttg 660 

211 caactcttaa tcgaaacagc tactagaggg aagatcaagc tgaagactcc cgaagaagcg 720 

212 atggagctcg tcgagaacat ggcggctagc gatcaagcaa tccttcatga tcacacttat 780 

213 gttcccacaa aaagaagcct cttggagctt agcacgcagg acgcaacttt ggtacaaaac 840 

214 aagctgttga cgaggcagat agaagccctc atcgaaaccc tcagcaagct gcctcaacaa 900 

215 ttacaagcga taagttcttc ccactcttct gttttgcagg tagaagaatg ccccacatgc 960 

216 agagggacac atgagcctgg acaatgtgca agccaacaag acccctctcg tgaagtaaat 1020 

217 tatataggca tactaaatcg ttacggattt cagggctaca accagggaaa tccatctgga 1080 

218 ttcaatcaag gggcaacaag atttaatcac gagccaccgg ggtttaatca aggaagaaac 114 0 

219 ttcatgcaag gctcaagttg gacgaataaa ggaaatcaat ataaggagca aaggaaccaa 1200 

220 ccaccatacc agccaccata ccagcaccct agccaaggtc cgaatcagca agaaaagccc 1260 

221 accaaaatag aggaactgct gctgcaattc atcaaggaga caagatcaca tcaaaagagc 1320 

222 acggatgcag ccattcggaa tctagaagtt caaatgggcc aactggcgca tgacaaagcc 1380 

223 gaacggccca ctagaacttt cggtgctaac atggagagaa gaaccccaag gaaggataaa 1440 

224 gcagtactga ctagagggca gagaagagcg caggaggagg gtaaggttga aggagaagac 1500 

225 tggccagaag aaggaaggac agagaagaca gaagaagaag agaaggtggc agaagaacct 1560 

226 aagcgtacca agagccagag agcaagggaa gccaag 1596 

229 <210> SEQ ID NO: 8 

230 <211> LENGTH: 532 

231 <212> TYPE: PRT 

232 <213> ORGANISM: Artificial Sequence 

234 <220> FEATURE: 

235 <223> OTHER INFORMATION: Description of Artificial Sequence: plant 



l6 °0/2900 



236 




retroelement sequence 




















238 


<400> SEQUENCE: 


8 






















Glu 


239 


Met 


Arg 


Gly 


Arg 


Thr 


Ala 


Ser 


Gly 


Asp 


Val 


Val 


Pro 


He 


Asn 


Leu 


240 


1 




5 










10 










15 


Glu 


242 


He 


Glu 


Ala 


Thr 


Cys 


Arg 


Arg 


Asn 


Asn 


Ala 


Ala 


Arg 


Arg 


Arg 


Arg 


243 








20 










25 










30 






245 


Gin 


Asp 


He 


Glu 


Gly 


Ser 


Ser 


Tyr 


Thr 


Ser 


Pro 


Pro 


Pro 


Ser 


Pro 


Asn 


246 




35 










40 










45 








248 


Tyr 


Ala 


Gin 


Met 


Asp 


Gly 


Glu 


Pro 


Ala 


Gin 


Arg 


Val 


Thr 


Leu 


Glu 


Asp 


249 


50 










55 










60 










251 


Phe 


Ser 


Asn 


Thr 


Thr 


Thr 


Pro 


Gin 


Phe 


Phe 


Thr 


Ser 


He 


Thr 


Arg 


Pro 


252 


65 










70 










75 










80 


254 


Glu 


Val 


Gin 


Ala 


Asp 


Leu 


Leu 


Thr 


Gin 


Gly 


Asn 


Leu 


Phe 


His 


Gly 


Leu 


255 










85 










90 










95 




257 


Pro 


Asn 


Glu 


Asp 


Pro 


Tyr 


Ala 


His 


Leu 


Ala 


Ser 


Tyr 


He 


Glu 


He 


Cys 


258 








100 










105 










110 






260 


Ser 


Thr 


Val 


Lys 


He 


Ala 


Gly 


Val 


Pro 


Lys 


Asp 


Ala 


He 


Leu 


Leu 


Asn 
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261 115 120 125 AUG 0l' /ijh 

263 Leu Phe Ser Phe Ser Leu Ala Gly Glu Ala Lys Arg Trp Leu His Ser [£QU /Vi /Tr . 

264 130 135 140 ^C/y /£,■:' -\ • - ? 

266 Phe Lys Gly Asn Ser Leu Arg Thr Trp Glu Glu Val Val Glu Lys Phe 

267 145 150 155 160 

269 Leu Lys Lys Tyr Phe Pro Glu Ser Lys Thr Val Glu Arg Lys Met Glu 

270 165 170 175 

272 He Ser Tyr Phe His Gin Phe Leu Asp Glu Ser Leu Ser Glu Ala Leu 

273 180 185 190 

275 Asp His Phe His Gly Leu Leu Arg Lys Thr Pro Thr His Arg Tyr Ser 

276 195 200 205 

278 Glu Pro Val Gin Leu Asn He Phe lie Asp Asp Leu Gin Leu Leu He 

279 210 215 220 

281 Glu Thr Ala Thr Arg Gly Lys He Lys Leu Lys Thr Pro Glu Glu Ala 

282 225 230 235 240 

284 Met Glu Leu Val Glu Asn Met Ala Ala Ser Asp Gin Ala He Leu His 

285 245 250 255 

287 Asp His Thr Tyr Val Pro Thr Lys Arg Ser Leu Leu Glu Leu Ser Thr 

288 260 265 270 

290 Gin Asp Ala Thr Leu Val Gin Asn Lys Leu Leu Thr Arg Gin He Glu 

291 275 280 285 

293 Ala Leu He Glu Thr Leu Ser Lys Leu Pro Gin Gin Leu Gin Ala He 

294 290 295 300 

296 Ser Ser Ser His Ser Ser Val Leu Gin Val Glu Glu Cys Pro Thr Cys 

297 305 310 315 320 

299 Arg Gly Thr His Glu Pro Gly Gin Cys Ala Ser Gin Gin Asp Pro Ser 

300 325 330 335 

302 Arg Glu Val Asn Tyr He Gly He Leu Asn Arg Tyr Gly Phe Gin Gly 

303 340 345 350 

305 Tyr Asn Gin Gly Asn Pro Ser Gly Phe Asn Gin Gly Ala Thr Arg Phe 

306 355 360 365 

308 Asn His Glu Pro Pro Gly Phe Asn Gin Gly Arg Asn Phe Met Gin Gly 

309 370 375 380 

311 Ser Ser Trp Thr Asn Lys Gly Asn Gin Tyr Lys Glu Gin Arg Asn Gin 

312 385 390 395 400 

314 Pro Pro Tyr Gin Pro Pro Tyr Gin His Pro Ser Gin Gly Pro Asn Gin 

315 405 410 415 

317 Gin Glu Lys Pro Thr Lys He Glu Glu Leu Leu Leu Gin Phe He Lys 

318 420 425 430 

320 Glu Thr Arg Ser His Gin Lys Ser Thr Asp Ala Ala He Arg Asn Leu 

321 435 440 445 

323 Glu Val Gin Met Gly Gin Leu Ala His Asp Lys Ala Glu Arg Pro Thr 

324 450 455 460 

326 Arg Thr Phe Gly Ala Asn Met Glu Arg Arg Thr Pro Arg Lys Asp Lys 

327 465 470 475 480 

329 Ala Val Leu Thr Arg Gly Gin Arg Arg Ala Gin Glu Glu Gly Lys Val 

330 485 490 495 

332 Glu Gly Glu Asp Trp Pro Glu Glu Gly Arg Thr Glu Lys Thr Glu Glu 

333 500 505 510 
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Please Not© ! 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> 
to <223> fields of each sequence which presents at least one n or Xaa. 

RECEIVED 

AUG 0 i 2002 



Seq*:166; N Pos . 6,15,16,18 
Seq#:168; N Pos. 7 
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